Palindromic richness and Coxeter groups 



Edita Pelantova 
Czech Technical University 



m 



Prague 



Czech Republic 

Stepan Starosta 



Czech Technical University 
Czech Republic 



m 



Prague 



Abstract 



For a given finite group G consisting of morphisms and antimorphisms of a free 
monoid A*, we study infinite words with language closed under the group G. We 
focus on words rich in generalized palindromic factors, i.e., in factors w satisfying 
Q{w) = w for some non-identical element B G G. We give several equivalent de- 
scriptions of rich words and show the role Coxeter groups play in the generalized 
notion of palindromic richness. 



In [15], Droubay et al. showed that the number of different paUndromes occurring in a 
finite word w cannot exceed the bound + 1. If this bound is met, the word w is called 
rich or rich in palindromes or full [15, 7]. An infinite word u is said to be rich if any its 
factor w is rich. The Ust of the most prominent rich words contains Arnoux-Rauzy words 
and words coding interval exchange with symmetric permutation of intervals (note that 
Sturmian words belong to both mentioned groups of words) . 

During the past two decades, the notion of palindromic richness showed to be fruitful. 
Application of palindromes in physics of quasicrystals (see for instance [17, 13]) and in 
genetics (see for instance [20]) served as stimulating factor for research in this well. 
Recently Restivo and Rosone [25] showed a narrow connection of rich periodic words with 
extremal cases of Burrows- Wheeler transform which is used in compression algorithms. 

Generalizations of rich words appeared soon. Instead of classical palindromes defined 
as words invariant under the reversal mapping, one considers Q -palindromes, i.e., words 
invariant under an involutive antimorphism 0. For first appearance of the notion see 
[19], where it appeared in the context of DNA, or [14], where the name pseudopalindrome 
is also used. Words saturated by 0-palindromes up to the highest possible level are 
called Q-rich. Another kind of generalization of rich words relaxes the requirement on 
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the number of palindromes occurring in any factor w. We say that infinite word u is 
almost rich, if there exists a constant D such that any factor w of u contains at least 
1^1 + 1 — 1) different palindromes. The minimal constant D with this property is referred 
to as palindromic defect and was introduced in [7] . Both mentioned generalizations can be 
combined into the notion of almost Q-rich words. It follows directly from the definition 
that almost 0-rich words contain infinitely many 0-palin dromes. 

As shown in [23], besides peculiar periodic words, no infinite uniformly recurrent words 
can be simultaneously almost ©i-rich and almost ©2-rich for two different involutive an- 
timorphisms Gi and 02. The famous Thue-Morse word contains infinitely many classical 
palindromes and 0-palindromes, where is antimorphism generated by interchange of 
symbols and 1. Nevertheless, the Thue-Morse word has no chance to be almost rich 
or almost 0-rich. Therefore, the authors have suggested in [23] a further generalization 
under the name of G -richness. The new definition of richness respects all antimorphisms 
of finite order under which the language of an infinite word is invariant, the letter G 
stands for the group generated by these antimorphisms. The definition is based on the 
notion of graph of symmetries, which is assigned to an infinite word. Adopting the new 
definition, the second author showed in [26] that all generalized Thue-Morse words t^.m 
are Dm-rich, where Dm is a group isomorphic to the dihedral group having 2m elements. 

It turned out that words rich in the classical sense can be characterized by using many 
other notions of combinatorics on words: return word, bilateral order, longest palindromic 
suffix, factor and palindromic complexity, and Rauzy graphs. These characterizations 
can be found in [16, 10, 3]. Analogous results for 0-rich words can be found in [27]. 
The aim of this article is to find G-analogies of these characterizations. They are stated 
as Theorems 13, 20 and 31 and Proposition 35. An important consequence of these 
characterizations is the fact that existence of a G-rich word forces the group G to be a 
finite Coxeter group, i.e., a group generated by involutive elements only, for definition 
see for instance [18]. The question whether for any finite Coxeter group G there exists a 
G-rich word remains open. At the end of the article we present two examples of G-rich 
word, in which G is a Coxeter group not isomorphic to a dihedral group. 

2 Preliminaries 

An alphabet ^ is a finite set. Elements of A are usually called letters. A finite word w 
over ^ is a finite string w = wiW2 . . .Wn of letters Wi G A. Its length, denoted by 1^1, 
equals n. The set of all finite words over A equipped with the operation of concatenation 
is the free monoid A*. Its neutral element is the empty word e. A word v & A* is a factor 
of a word w E A* if there exist words s,t G A* such that w — svt. If s — e, then v is a 
prefix of w, if t — e, then v is a suffix of w. 

2.1 Antimorphisms and their fixed points 

A mapping (p on A* is called 
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• a morphism if ip{vw) = Lp{v)ip{w) for any v,w & A*] 

• an antimorphism if if{vw) — ip{w)ip{v) for any v,w & A*. 

We denote the set of all morphisms and antimorphisms on A* by AM (A*). Together with 
composition, it forms a monoid with the identity mapping Id as the unit element. The set 
of all morphisms, denoted by M{A*), is a submonoid of AM{A*). The reversal mapping 
R defined by 

R{WiW2 ...Wn)= WnWn-1 ■ ■ ■ W2W1 

is an involutive antimorphism, i.e., = Id. It is obvious that any antimorphism is a 
composition of R and a morphism. Thus 

AM{A*) = M{A*) U R{M{A*)). 

A fixed point of a given antimorphism © is called Q -palindrome, i.e., a word w is a 
G-palindrome if w = 0(w). If is the reversal mapping R, we say palindrome or classical 
palindrome instead of i?-palindrome. One can see that if O has a fixed point containing 
all the letters of A, then is an involution, and thus a composition of R and an involutive 
permutation of letters. 

2.2 Factor and palindromic complexities 

An infinite word u over an alphabet .4 is a sequence (Mn)neN ^ A^. We always implicitly 
suppose that A is the smallest possible alphabet for u, i.e., any letter of A occurs at least 
once in u. Action of any morphism (p e M{A*) can be naturally extended to infinite 
words by the prescription 

ip{u) = (f{uo)Lp{ui)Lp{u2) ... for all u = (M„)„eN e 

A finite word w is a factor of u if there exists an index i eN, called occurrence of w, such 
that w = UiUi+i . . . The set of all factors of u of length n is denoted >C„(u). The 

language of an infinite word u is the set of all its factors C{u) = U„gN£„(u). An infinite 
word u is recurrent if any its factor has at least two occurrences in u. A factor v e >C(u) 
is a complete return word of a factor -u; if -u; occurs in v exactly twice, as a suffix and a 
prefix oi V. A complete return word f of w can be written as v = qw for some factor 
q 7^ e, which is usually called a return word of w. If any factor w of u has only finitely 
many return words, then u is said to be uniformly recurrent. 

The factor complexity of u is the mapping C : N i-> N defined by the prescription 

C{n) #/:„(u). 

To evaluate the factor complexity of u, one may watch possible prolongations of factors. 

A letter a G ^ is a left extension of a factor in u if aw belongs to £(u). The set 
of all left extensions of w is denoted Lext(w). A factor w G C{u) is called left special, 
if #Lext(w) > 2. Analogously, we define right extension, the set Rext(u'), and right 



3 



special. If w is right and left special, we call it bispecial. The first difference of the factor 
complexity of a recurrent word u satisfies 

AC{n) = C{n + 1) - C{n) = ^ (#Lext(w) - l) = ^ (#Rext(w) - l). 

wec„{u) weCniu) 

The second difference of factor complexity can be expressed using the bilateral order of a 
factor. Let be a factor of u. Its bilateral order is the quantity b(w) := ^{awb \ awb G 
£(u), a,beA}- #Lext(w) - #Rext(w) + 1. In [12], the formula 

A'C(n) = AC(n + 1) - AC(n) = ^ bH 

is deduced. 

The Q-palindromic complexity of u is the mapping VQ{n) : N i->- N defined by 

Vein) := #{w G >C„(u) | w = e{w)}. 

Similarly to factor complexity, evaluation of palindromic complexity is based on counting 

possible extensions of palindromes. For a 0-palindrome w, we denote by Pexte(i(;) the 
set of all letters a & A such that awQ{a) e >C(u). It is easy to see that 

Vein + 2)- Vein) ^ J2 (#PcxteH-l). (1) 

w£C„(u) 

w=e{w) 

2.3 Words with language closed under a group G and G-richness 

In the sequel, the symbol G stands exclusively for a finite subgroup of AMiA*) containing 
at least one antimorphism. The set of all involutive antimorphisms belonging to G is 
denoted by G^'^\ We say that finite words w,v e A* are G -equivalent if there exists 
II & G such that w — /liv). The class of equivalence containing a word w is denoted 

[w] := {^liw) \iieG}. 

Since the group G is finite, any /i & G preserves length of words and thus equivalent words 
have the same length. 

We say that language >C(u) of an infinite word u G A^ is closed under G if for any 
factor w e >C(u) and any E G, the image belongs to C{u) as well. Since G contains 
at least one antimorphism, closedness of >C(u) under G implies that u is recurrent. 

The following graph assigned to such a word u was defined in [23]. It generalizes the 
notion of super reduced Rauzy graph introduced in [10]. 

Definition 1. Let u be an infinite word with language closed under G and n e N. 
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1) The directed graph of symmetries of the word u is I^„(u) = {V,^) with the set of 

vertices 

V — {[w] \ w & >C„(u), t/; is left or right special} 
and an edge e G ^ C C{u) starts in the vertex [w] and ends in the vertex [v], if 

• the prefix of e of length n belongs to [w] , 

• the suffix of e of length n belongs to [v], 

• e has exactly two occurrences of special factors of length n. 

2) The undirected graph of symmetries of the word u is r„(u) = (V, E) with the same set 

of vertices as T^„(u) and for any e E >C(u) we have 

[e]eE <^ e e ^. 

Any vertex [w] of the graph T'n(u) represents a class of equivalent factors of >C„(u). 
It has at most elements; the actual cardinality of [w] may depend on n as well. 

Definition 2. Let u be an infinite word with language closed under G. We say that a 
number n is G- distinguishing on u if for any w G jCn{u) it holds: 

©1 7^ ©2 =^ Qi{w) Q2{'w) for any two antimorphisms Qi,Q2 & G. (2) 

If n is G-distinguishing on u, then the knowledge of a pair w and Q{w) for a single 
word w G Cn{u) enables us to determine ©. Let us stress that the requirement (2) gives 
also for any two distinct morphisms 01, 02 G G that 7^ 02 ("U^)- Thus, n is the length 

of words on which action of any element of the group G is fully demonstrated. 

One can readily see that if n is G-distinguishing on u, then any m > n is also G- 
distinguishing on u. 

As proved in [23], connectivity of graphs of symmetries enables to bind the factor and 
palindromic complexities. 

Theorem 3. Let u be an infinite word with language is closed under G and N & N be 
G -distinguishing on u. Then 

AC(n) + #G > (Ve{n) + Ve{n+1)^ foranyn>N. (3) 

eeG(2) 

The term AC(n) + #G represents an upper bound on the number of palindromes 
occurring in u. Words for which this upper bound is reached are in some sense opulent 
in palindromes. Proof of Theorem 3 suggested us to adopt the following definition of 
Property G-graph(A^). 

Definition 4. We say that an infinite word u has Property G-graph{N) if for each n 
n > N it holds 
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• C{u) is closed under G; 



• if [e] is a loop in r„(u); then e is a Q-palindrome for some © e G; 

• the graph obtained from r„(u) by removing loops is a tree. 

Definition 5. We say that u is G-rich if u has Property G-graph{l) and u is almost 
G-rich if there exists N such that u has Property G-graph{N) . 

In the next section, we explain legitimacy of the name G-richness, i.e., we show that 
the classical richness is contained in our new definition of richness as well. 

2.4 Palindromic richness in the classical sense 

Let us recall the origin of palindromic richness. In this section, we use the word palindrome 
for i?-palindrome where R denotes the reversal mapping, and we denote by Pal(u') the 
set of all palindromic factors of a word w including the empty word e. In [15], Droubay, 
Justin and Pirillo provided the simple upper bound 



This bound serves for the definition of richness in the classical sense. 

Definition 6. A finite word w is called rich if the equality in (4) holds. An infinite word 
is rich if all its factors are rich. 

Another type of bound on number of palindromes contained in an infinite word was 
proved in [1]. If an infinite word has its language closed under the reversal mapping, then 
the following inequality holds 



The authors of [16] showed that an infinite word u with language closed under reversal is 
rich if and only if the equality in (5) is attained for all n G N. Their proof uses the notion 
of super reduced Rauzy graph, which, in our terminology, is the graph of symmetries 
r„(u) for the group G = {Id, R}. A word u is shown to be rich if and only if u satisfies - 
again in our terminology - Property G'-graph(l). 

In Section 6, we illustrate on the Thue-Morse word why Definition 5 of G-richness is 
based on Property G'-graph(l) instead of a modification of the inequality (4). 

Let us point out a drawback of our new definition. The Property G-graph(l), unlike the 
classical Definition 6 of richness, requires C{u) to be closed under reversal. Nevertheless, 
an infinite word can be rich in the classical sense without having its language closed 
under reversal. On the other hand, as proved in [16] (Proposition 2.11), any recurrent 
rich word has its language closed under reversal. Therefore, on the set of recurrent words 
both definitions coincide. The following theorem summarizes properties characterizing 
rich recurrent words, their proofs can be found in [16, 15, 10, 3]. 



#Pal(w) < |w| + 1. 



(4) 



AC(n) + 2 > V{n) + V{n + 1) for all ueN. 



(5) 
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Theorem 7. For an infinite word u with language closed under reversal the following 
statements are equivalent: 

1. VL is rich, 

2. any complete return word of any palindromic factor of u is a palindrome, 

3. for any factor w of u, every factor ofu that contains w only as its prefix and R{w) 
only as its suffix is a palindrome, 

4- the longest palindromic suffix of any factor w G >C(u) is unioccurrent in w, 

5. for each n the following equality holds 

C{n + 1) - C{n) + 2 = V{n) + V{n + 1), 

6. each graph of symmetries r„(u) satisfies: all its loops are palindromes and the graph 
obtained from r„(u) hy removing loops is a tree, 

7. any bispecial factor w of u satisfies: 

• if w is non-palindromic, then 

h{w) = 0, 

• if w is a palindrome, then 

h{w) = #Pext(w) - 1. 

Another characterization of rich words can be found in [11]. Almost rich words were 
introduced in [16]. The definition uses the notion of defect of an infinite word, which is 
based on the inequality (4). Therefore, we once more postpone the discussion about a 
relation between the notion of almost G-almost richness and almost richness to Section 6. 
Words almost rich in the classical sense can be characterized by properties analogous to 
those listed in Theorem 7. For more details see [4]. 

As already mentioned, in the articles [19, 14], the reversal mapping R is replaced by an 
arbitrary involutive antimorphism © and ©-palindrome is defined as a word v satisfying 
©(^;) = V. Let us denote by Pale(w) the set of ©-pahndromic factors occurring in w. As 
shown in [27], 

#PaleH < |w| + l-70H, (6) 

where 7e(w) := # {{a, ©(a)} \ a E A.a occurs in w and a ^ 0(a)}. Analogously to the 
classical richness, ©-richness is introduced. A finite word w is called 0-rich if the equality 
in (6) holds. An infinite word is ©-rich if all its factors are ©-rich. Generalizations of 
some characterizations in Theorem 7 for ©-rich words are presented in [27] as well. 
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3 Tools for characterization of G-rich words 



Classical rich and almost rich words can be described using the notions of return words 
and longest palindromic suffix. In order to find a suitable description of G-richness, we 
at first introduce G-analogies of these notions and in the sequel we demonstrate their 
efficiency. Let us recall that in all definitions and statements in the sequel, the symbol G 
stands for a finite subgroup of AM{A*) which contains at least one antimorphism. 

Definition 8. A word w & A* is said to he G -palindrome if there exists an antimorphism 
e G such that w — Q{w). 

Definition 9. Let w,v G A*. G-occurrence of a word w in a word v is an index i such 
that there exists w' G [w] having occurrence i in v. 

We say that w is G-unioccurrent in v ifw occurs in v and there is no other G-occurrence 
of w in V. 

Definition 10. Let u G A^ be an infinite word and w G >C(u). A factor v G C{n) of 
length \v\ > \w\ is called complete G -return word of [w] in u if 

• a prefix and a suffix of v belong to [w] and 

• V contains no other occurrence of a word from [w] . 

We say that v' G C{u) is a G-return word of [w] in u if for some w' G [w] the word 
v'w' is a complete G-return word of [w] . 

Definition 11. A suffix w of a word v & A* is called G-longest palindromic suffix of v if 

• w is a G-palindrome and 

• |w| > for any G -palindromic suffix w' of v. 

The G-longest palindromic suffix of v is denoted by G-\ps{v) . 

Remark 12. If no non-empty suffix w of a non-empty word f G ^* is a G-palindrome, then 
G-lps(i') = e. It may occur only in case when G does not contain the reversal mapping 
R. Let us recall that any index i G {1, 2, . . . , n} is defined to be an occurrence of e in 

V ^ V1V2. . .Vn-l- 

4 G-richness and G-return words 

In this section, we demonstrate that the notions of G-return word and G-palindrome can 
grasp the essence of G-richness as shown in the following theorem. 

Theorem 13. Let u be an infinite word with language closed under G. Then u is G-rich 
if and only if for all w G >C(u) every complete G-return word of [w] is a G-palindrome. 



8 



The statement for G = {Id, i?} is shown in [16] and for G = {Id, 0} in [27]. A 
statement analogous to Theorem 13 holds for almost G-richness as well and we prove it 
in this subsection as well. Let us recall that according to Definition 5, almost G-richness 
means property G-graph(iV) for some A^. First we introduce a new property based on 
G-return words which depends on a parameter N as well. 

Definition 14. Let N E N. We say that u e .4.^ satisfies Property G-crw{N) if for all 
w e C{u), \w\ > N, every complete G-return word of [w] is a G-palindrome. 

The proof of Theorem 13 is split into proofs of Lemmas 15 and 16 and Lemma 19. 

Lemma 15. Let N E N and u G satisfy Property G-graph{N). Let w be a factor of 
u such that \w\ > N and v be a complete G-return word of [w] in u starting in w. Then 
there exist a letter a & A and an antimorphism © e G such that wa is a prefix of v and 
Q{a)Q{w) is a suffix of v. 

Proof. Take n > N , w & >C„(u), and v a complete G-return word of [w] starting in w. 
Denote w' e [w] the suffix of v of length n. 

At first we suppose that w is a special factor of u. We consider the following two 
cases. We exploit the graphs of symmetries T'„(u) = {V, ^) and r„(u) = (V, E). 

1. If besides the prefix w and the suffix w' the complete G-return word v contains no 

other occurrences of a special factor of length n, then [v] is an edge in E which 
starts and ends in the same vertex [w] . The edge [v] is thus a loop and according to 
the definition of Property G-graph(A^), the factor v is a G-palindrome. Obviously, 
V has the property stated in the claim. 

2. Let V = VqVi . . . v„i contain a special factor z ^ [w\ of length n at the position i, 
i.e., z = Vi . . . We may suppose without loss of generality that i is the least 
index with this property. Then [z\ is a vertex of I^„(u) and a prefix of v is an edge 
in T^„(u) starting in [w] and ending in [z\. Since the graph obtained from r„(u) 
by removing loops is a tree, the complete G-return word v has a suffix / G ^ such 
that a prefix of / belongs to [z] and its suffix belongs to [w\. Moreover, there exists 
an antimorphism such that / = 0(e). As |e| = |/| > \'w\, the factor v has the 
property stated in the claim. 

Let us now suppose that w is not a special factor and thus w has a unique right 
extension, say a. If w' = Q{w) for some antimorphism G G, then Q{w) has a unique 
left extension 0(a) and therefore Q{a)Q{w) is a suffix of the complete G-return word v 
as stated in the claim. 

To finish the proof, it is enough to consider the situation when w' — n{w) for some 
morphism ji E G and w is not a special factor. We discuss two separate cases. 

1. There exists no special factor of length n. 

In this case u is periodic. Denote v' the word such that v = v' ii{w). Because no 
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special factor of length at least n exists, the factor v = v'fi{w) is the unique right 
prolongation of w of length \v\. As C{u) is closed under G, fi''{v) = fj,^{v')fj,^'^^{w) is 
the unique right prolongation of //{w) of given length. In particular, for £ = 1 it im- 
plies v'^{v')fi'^{w) G jC.{u). Repeating this argument for £ = 2, 3, . . ., we deduce that 
v'i^{v')fi'^{v') . . . iJ,^{v')iJ,^~^^{w) G £(u). Therefore, the factor . . 

where k is the order of the morphism /j,, is a period of u which does not contain any 
antimorphic image oi w - a contradiction. 

2. There exists a special factor of length n. 

Consequently, there exists a unique q such that wq is right special and no proper 
prefix of wq is right special. The factor wq has only one occurrence of a factor ^{w) 
for some morphism u E G - in the opposite case, we can find a shorter prolongation 
of w which is right special. Since v has suffix fJ^{w), we deduce \wq\ < \v\. As fi is 
a morphism, fi{w)fi{q) is the only right prolongation of fi{w) and thus vfi{q) is a 
complete G- return word of [wq] . For the special factor wq, we may now use the first 
part of the proof and thus find an antimorphism © such that ii{w)ii{q) = Q{wq) ~ 
Q{q)Q{w). Applying morphism we get wq — ii~^Q{q)fi~^Q{w). Together with 
the inequahty \wq\ < \v\, it implies a contradiction with the fact that v is aG return 
word of [w]. 

□ 

Lemma 16. Let u G and N eN. Ifu has Property G-graph{N), then it has Property 
G-crw(N). 

Proof. Let w be a complete G-return word of [w] starting in w for a factor w such that 
1^1 > N. Denote by a a letter such that wa is a prefix of v. 
If wa = V, then Lemma 15 implies that f is a G-palindrome. 

If wa 7^ V, then according to Lemma 15, f is a complete G-return word of [wa] as well. 
We apply the procedure again on wa. We find a letter b such that wab is a prefix of v. If 
wab = V, the t; is a G-palindrome, otherwise w is a complete G-return word of [wab]. 

We continue in this way until the procedure stops and we conclude that v is a 
G-palindrome. □ 

Remark 17. For an infinite word u satisfying Property G-graph(iV), the occurrences of 
morphic and antimorphic images oi w, ]w\ > N, alternate. This consequence of the 
previous claim is an analogy to the claims stated in [16] for rich words and in [27] for 
6-rich words. 

Theorem 18. Let there exist an almost G-rich word. Then G is a Coxeter group. 

Proof. At first suppose that the G-rich word u is periodic. According to [23], for a periodic 
word u the closedness of >C(u) under an antimorphism forces © to be involutive. Thus, 

all antimorphisms in G are involutive and therefore G is a Coxeter group. 

Now suppose that the G-rich word u is aperiodic. Let w be a prefix of a G-rich word u 
such that any letter of the alphabet occurs in w. According to Lemma 15, any occurrence 
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of a word from [w] is a 0-image of the left closest occurrence of a factor from [w], where 
© is an antimorphism. Since all letters occur in w, the antimorphism © is involutive. 
Therefore, any factor i/(u') occurring in u can be written as 1/(10) — ©i©2 . . . ©s('Ji'), 
where 0i02 ... ©^ is a sequence of involutive antimorphisms. As w contains all letters, 
the equality ulw) = ©i©2 ■ ■ ■ ©s(w) implies u = ©i©2 ... ©5. 

□ 

Lemma 19. Let u be an infinite word with language closed under G and N & N. If u 
satisfies Property G-crw{N), then it satisfies Property G-graph{N). 

Proof. Let n > N and w G £n(u). We assume that every complete G-return word of [w] 
is a G-palindrome. We have to show two properties of r„(u). 

1. Any loop in r„(u) is a G-palindrome. 

Since any loop e in r„(u) at a vertex [w] is a complete G- return word of [w], the 
loop e is a G-palindrome by our assumption. 

2. The graph obtained from r„(u) by removing loops is a tree. 

Or equivalently, we show that in r„(u) there exists unique path between any two 
different vertices [w'] and [w"] . Let p be a factor of u such that a prefix of p belongs 
to [w'], its suffix belongs to [w"] and p has no other occurrences of factor from [w'] 
or [w"] . Let without loss of generality w' be a prefix of p. Let us find a complete G- 
return word of [w'] with prefix p, denote it v. Since v is a G-palindrome, the factor 
©(p) is a suffix of V for some antimorphism © e C As is a complete G-return 
word of [w'], V has exactly two G-occurrences of w'. The factor v contains at least 
two G-occurrences of w". Therefore, the next factor with the same properties as p, 
i.e., representing a path in the undirected graph r„(u) between [w'] and [«/'], which 
occurs in u after p, is 0(p). Consequently, any factor with the same properties as p 
belongs to the same equivalence class [p] . 

□ 

5 G-richness and G-longest palindromic suffix 

As we have already stated, the classical richness is connected to the number of occurrences 
of the longest palindromic suffix in any factor. In the case of classical palindromes, the 
longest palindromic suffix of any word is always non-empty, but it is not always satisfied 
for the G-longest palindromic suffix. Therefore, the characterization of G-richness by the 
G-longest palindromic suffix needs a modification. The classical case of the following 
statement is established in [16] and for ©-richness in [27]. 

Theorem 20. Let u G be an infinite word with language closed under G. Then 
u is G-rich if and only if for any factor v G jC{u), its G-longest palindromic suffix is 
G-unioccurrent in v or the last letter of v is G-unioccurrent in v. 
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Again we prove a more general statement, since an analogous theorem holds for almost 
G-richness as well. At first we introduce a new property depending on a parameter A^. 

Definition 21. Let A^ e N. We say that u G .4^ satisfies Property G-lps{N) if for all 
w G C{u), \w\ > N, either the word G-lps(w) is G-unioccurrent in w, or the suffix of w 
of length 1 has exactly one G-occurrence in w. 

Lemma 22. Let u G . 

1. If xx has Property G-crw{l), then u has Property G-lps{l). 

2. Let A" G N. If u is uniformly recurrent and has Property G-crw{N), then there 
exists M G N such that u has Property G-lps{M) . 

Proof. Let us realize a trivial fact: if w G C{x\) has a suffix v which is not G-unioccurrent 
in then there is a suffix of w which is a complete G-return word of [v\. 

To prove the first assertion, consider a factor w G £(u). If the last letter of w, 
denoted a, is G-unioccurrent, we have nothing to do. If a is not G-unioccurrcnt in 
then according to the mentioned fact and by Property G-crw(l), a complete G-return 
word of [a] is a G-palindrome of length greater than 1. Therefore, G-lps(w) ^ e. We have 
to show that G-\ps{w) is G-unioccurrent in w. If not, then a suffix of w is a complete 
G-return word of [G-lps(u7)] which, according to Property G-crw(l), is a G-palindrome 
longer than the G-longest palindromic suffix of to - a contradiction. 

Now we prove the second assertion. Since u is uniformly recurrent, there exists an 
integer M such that every factor w G >C(u), \w\ > M, contains at least two occurrences of 
every factor v G £iv(u). It particular, it implies that w contains at least two G-occurrences 
of its suffix z of length A^. By the fact mentioned at the beginning of the proof. Property 
G-crw(A") implies that G-lps('u;) is longer than A" and is G-unioccurrent in w. 

□ 

Remark 23. Let us note that the second part of the previous lemma can be proved con- 
sidering a weaker assumption than uniform recurrence of u. It is enough to assume that 
every factor w G C{u) has only finitely many complete G-return words. 

Lemma 24. Let ue A^. 

1. Ifu has Property G-lps{l), then u has Property G-crw{l). 

2. Ifu has Property G-lps{N) with N > 1, then u has Property G-crw{N — 1). 

Proof. We prove both claims simultaneously. If A^ = 1, set M = 1. Otherwise set 
M = A" — 1. We prove by contradiction that u satisfies Property G-crw(M). 

Suppose there is a factor w G C{u), \w\ > M, such that there is a factor v G C{u) 
which is a complete G-return word of [w] and is not a G-palindrome. Denote p the prefix 
of u ending in the leftmost occurrence of a factor from [v]. It is clear that \p\ > N. 

Since p ends in a non-empty G-palindromic complete G-return word, the suffix of p 
of length 1 has at least two G-occurrences and thus Property G-lps(A^) assures that p 
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has a non-empty G-longest palindromic suffix which is G-unioccurrent. Let us denote 
X :— G-lps(p). 

If < |x| < then x has at least two G-occurrences in p - a contradiction. 

If \w\ < \x\ < \v\, then we can find a third G-occurrence of w in v - a, contradiction 
with V being a complete G- return word of [w]. 

If |a;| = |f I, then we have a contradiction with v not being a G-palindrome. 

If |a;| > \v\, then we can find a factor v' e [v] such that its occurrence is in contradiction 
with the choice of the prefix p. 

□ 

Remark 25. Again the assumptions of the previous lemma can be reduced, as it is visible 
in our proof. It is enough to require that any prefix w of u of length greater than or 
equal to has unique G-longest palindromic suffix or the last letter of the prefix v is 
G-unioccurrent in v. 

6 G-defect 

Richness in the classical sense is closely related to the notion of defect of a word. As 
introduced in [7], the defect of a finite word w is defined as follows 

D{w) = l^l + 1 - #Pal(w). 

The defect of an infinite word u is defined as 

D{u) = sup {D{w)}. 

The relation with richness is clear: a word is rich if its defect is zero and it is almost 
rich if its defect is finite. Since the defect of a finite word depends only on its length and 
number of palindromes contained in it, it is straightforward to see that for any a e ^ we 
have 

D{w) < D{wa) < D{w) + 1, D{w) < D{aw) < D{w) + 1, and D{w) = D{R{w)). 

The question we address here is how to define a G-analogy of defect when the group G 
contains more than two elements. Of course, we would like to find a definition of G-defect 
such that G-richness and almost G-richness would be again connected with G-defect in a 

reasonable way. 

Let us illustrate on the Thue-Morse word Utm the number of distinct G-palindromes 
contained in its factors. The Thue-Morse word u G {0, 1}^ is defined as the fixed point 
of the morphism (t(0) = 01 and cr(l) = 10 with the starting letter 0, i.e., 

Utm = 011010011001011010010110011010011001011001101001011010. . . 

The language of the Thue-Morse word is invariant under the reversal mapping R and under 
the antimorphism which permutes letters and 1. In [23], we showed that the Thue- 
Morse word is G-rich for G = {Id, i?, 0, i?0}. In Table 1, the numbers of G-palindromic 
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Table 1: Count of palindromes and 0-palindromes in the prefixes of the Thue- Morse word 
Utm- The prefix of Utm length n is denoted p„. 

factors of short prefixes of the Thue-Morse words are depicted. To generalize the notion 

of defect, the counting of G-palindromes must be replaced by counting the classes [w] of 
G-palindromes. Thus, we define the set, denoted PalG('u;), of all G-palindromic classes of 
equivalence in a finite word w as follows 

PalG(w) := {[v] I v is a factor of w and a G-palindrome} . 

Definition 26. Let w be a finite word. The G-defect of w is defined as 

Dg{w) := + 1 - #PalG(u') - 1g{w), 

where 

Iciw) = # {[a] I V6 G G*-^-*, 9(a) ^ a, a is a factor of . 
It follows from the definition that for all w & A* and e G it holds Dg{w) — 

The authors of [15] also observed that the classical richness of w can be characterized 
by so-called Property Ju: Any prefix of w has unioccurrent longest palindromic suffix. 
The notion of the longest palindromic suffix helps to calculate the defect of a word. For 
a word w and a letter a, one can see that 

^ . , \ Diw) , if wa has unioccurrent longest palindromic suffix 

D[wa) = < 

I D[w) + 1 , otherwise. 

Therefore, the defect of a finite word w — W1W2 ■ ■ - Wn equals to the number of indices i 
for which W1W2 . . . Wi^iWi does not have a unioccurrent longest palindromic suffix. Such 
indices are in [6] called lacunas and in [16] defective positions. Inspired by this, we adopt 
the following definition of G-defective positions. 
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Definition 27. Let w = Wi . . .Wn G A. An integer 1 < i < n is called G-defective 
position in w if Wi and G-lps(wi . . . Wi) are not G-unioccurrent in w\. . . Wi. Otherwise, i 
is called a G -non- defective position. 

The next lemma follows from comparing the last definition and the definition of 
G-defect. 

Lemma 28. Let w e A, then 

Dg{w) — #{1 <i<n\iisa G-defective position in w}. 
Moreover, it can be easily shown that the following relations are preserved: 
Dg{w) < Daiwa) < Dg{w) + 1 and Dg{w) < DG(aw) < Dg(w) + 1 
for all w E A* and a E A. Therefore, we can define G-dcfcct of an infinite word. 

Definition 29. Let u be an infinite word. The G-defect of \x, denoted Dg{vl), is defined 
as 

L'g(u) := sup {Dg{w)}. 

The immediate connection with Property G-lps(A'^) is summarized in the following 
lemma. 

Lemma 30. Let u he an infinite word with language closed under G. 

1. Dg{vl) = if and only if u satisfies Property G-lps{l). 

2. If there exists an integer N such that u satisfies Property G-lps{N), then Dg{u) is 
finite. 

3. Ifu is uniformly recurrent and Dg{vl) is finite, then there exists an integer N such 
that u satisfies Property G-lps{N) . 

Proof. The first two claims follow from Lemma 28. 

To show the last claim, suppose -Dg(u) to be finite. Then there exists a prefix v :— 
UqUi . . .um-1 of u such that Dg{u) = Dg{v) and any letter of the alphabet occurs in 
V. As u is uniformly recurrent, there exists a constant such that any factor w of u of 
length at least — 1 contains the prefix v as its factor. Using the definition and basic 
properties of G-defect, and maximality of Dg{v), we obtain Dg{v) = Dg{w) = Dciwa) 
for any a E A such that wa e C{n). Therefore, the last position in wa is not G-defective, 
i.e., u has Property G-lps(A^). 

□ 

It remains to connect G-defect with (almost) G-richness. 

Theorem 31. Let u he an infinite word with language closed under G. 
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1. -Dg(u) = if and only if u is G-rich. 



2. If u is uniformly recurrent, then Dg{vl) is finite if and only if u is almost G-rich. 

Proof. The first part follows from Lemma 30 and Theorem 20. 

To show the second part, one can see that it follows from Lemma 30 that u satisfies 
Property G'-lps(A^) for some if and only if -Dg(u) is finite. We can then use Lemmas 22 
and 24 to get equivalence with having Property G-crw{N') for some A^'. Finally, we use 
Lemmas 16 and 19 to prove the equivalence with having Property G-graph(iV') which is 
by definition equivalent with almost G- richness of u. 



7 G-richness and bilateral order 

As we have state in Theorem 7, point 7, words rich in classical sense can be characterized 
using bilateral order of bispecial factors. The proof of this fact given in [3] is based on the 
vahdity of point 5 of Theorem 7. In [23], a weaker form of G-analogy of this statement is 
given. 

Proposition 32. Let u be an infinite word with language closed under G, M be a G- 
distinguishing number on u, and N be an integer N > M. The word u satisfies Property 
G-graph{N) if and only if 



Unfortunately, we have no modification of the previous proposition describing Prop- 
erty G-graph(A^) for N which are not G-distinguishing. Therefore, in this section we 
concentrate on the notion almost G-richness. Let us rephrase the previous proposition in 
a more handy way. 

Corollary 33. Let u be an infinite word with language closed under G, M be a G- 
distinguishing number on n, and N be an integer N > M. Then u satisfies Property 
G-graph{N) if and only if 



□ 




for any n> N. 



(7) 



eeG(2) 



1. 




GeG(2) 



2. and for all n > N , we have 



E (#PexteH-l). 



eeG(2) M)e£„(u) 

w=Q{w) 
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Proof. The task to verify equalities a{n) = h{n) for all n > N means to verify a{N) = b{N) 
and Aa{n) = Ab{n) for all n> N. Let us consider a{n) to be equal to the left-hand side 
and b{n) to the right-hand side of (7). It is now enough to reahze that 

A6(n)= J] P0(n + 2)-PeH= J] J] (#Pext0H-l), 
GeG(2) eeG(2) w&c„{ii) 

w=Q{w) 

where we used equality (1). □ 

Proposition 34. Let e N, u e satisfy G-graph{N), and w be a bispecial factor of 
u of length at least N. 

• If w is not a G-palindrome, then 

h{w) > 0. 

• If w is a Q-palindrome for an antimorphism e G, then 

h{w) > #PextG(w) - 1. 

Proof. Let w be a bispecial factor having its length M := 1^1 > N such that for all 
antimorphism © e G, Q{w) ^ w. By the definition of b(w), we want to prove 

#Bext(w) > #Rext(w) + #Lext(w) - 1. (8) 

Let B{w) be a bipartite graph with the set of vertices 

V{w) = {aw I a e Lext(w)} U {wb \ b e Rext(w)} . 

There is an edge connecting vertices aw and wb if the word awb is a factor of u. The 

number of vertices in the graph B{w) is 7^Rext(i(;) -|- 7^Lext(-u;) and the number of edges 
is 7^Bext(u'). In the sequel, we show that this graph is connected. Since in any connected 
graph the number of edges equals at least the number of vertices minus one, the inequality 
(8) follows. 

Let be an increasing sequence of indices such that /cq > is an occurrence of 
w and kn is an G-occurrence of w for any n > 1. Moreover, any G-occurrence of w in 
the suffix MA:o'Wfco+i«A:o+2 • • • of u bclougs to the sequence (A;„). As u satisfies Property 
G-graph(A^), then according to Remark 17 we have 

Mfc^Mfcj+i . . . Mfc^+M-i = i^i(ii'), where i^i e G is an antimorphism, 

WfcjWfea+i ■ ■ ■ Wfc2+M-i = ^2{w), where ^2 G G is a morphism, 

WfcgWfeg+i . . . Wfc3+M-i = where 1/3 e G is an antimorphism, 

and so on. 

The restriction of e G to the set of letters is just a permutation. Therefore, for any 
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u ^ G and any b E A, there exists a letter a E A such that b = ^{a). Thus, for any n G N, 
the factor Uk„-iUk„Uk„+i . . . Uk^+M can be written as 

Ukn-i'^k„Uk„+i ■ ■ ■ Uk„+M = i^n{cn)i^n{w)^n{dn) for some letters c„ and (i„. 

As i'2i is a morphism and 1^21+1 is an antimorphism, we have 

C2iwd2i G £(u) and d2i-iWC2i-i G £(u). (9) 

Because w is not a G-palindrome, any G-occurrence of w together with the left and the 
right neighboring letters corresponds to a unique edge in the graph B{w). For any n € N, 
the factor 

Vn{w)Vn{dn) ■ ■ ■ Vn+l{Cn+l)l^n+l{w) 

is a complete G- return word of [w]. According to Lemma 15, there exists an antimorphism 
e G G such that Vn+i{w) = ^{i^n{w)) and ^'„+i(c„+i) = ©(i/„(d„)). As Vn+i{w) = 
©(i/„(w)) implies z/„+i = Qv^, we get Vn+iicn+i) = ©(i^n(cn+i)) = ©(i^n(c^n)) and thus 
Cn+i — dn- Using (9) we obtain 

C2iWC2i+i G jC{u) and C2iWC2i-i G jC(u). 

Recurrence of u implies 

V{w) = {c2iW I i G N} U {i(;c2i-i I i G N}. 

Walking along u, each G-occurrence of w represents an unordered edge connecting C2iW 
with wc2i-i or C2iW with wc2i+i, and thus in fact walking along u represents a walk in the 
graph B{w). Since any factor aw and wb occurs in £(u) infinitely many times, this walk 
in the graph B{w) uses all vertices of B{w). Therefore, the graph B{w) is connected. 

Now consider a G-palindromic bispecial factor w and denote by © the antimorphism 
such that w — Q{w). We define the bipartite graph B{w) in the same way as before. 
If awb G C{u) and b 7^ ©(a), then B{w) contains with the edge awb also the different 
edge Q{b)wQ{a). Therefore, any G-occurrence of w together with the left and the right 
neighboring letters corresponds to a pair of edges in the graph B{w) unless it represents 
a ©-palindromic extension awQ{a). Let us replace the graph B{w) by the graph B'{w) in 
which vertices are couples {a'W,'wQ{a)} and edges are either couples {awb,Q{b)wQ{a)} 
or loops representing a ©-palindromic extension {awQ{a)} . Now we can interpret a walk 
along U. clS cl walk in the new graph B'{w). Consequently, the graph B'{w) must be 
connected. The connectivity of B'{w) implies that the number of edges in B'{w) which 
are not loops is at least equal to #Rext(u') — 1. Since 

^{awb I 6 7^ ©(a)} = 2 X number of edges in B'{w) which are not loops, 

we obtain 

#Bext(w) = #{aw6 | b ^ ©(a)} + #Pext©(w) > 2(#Rext(w) - 1) + #Pexte(w) 
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As ^Rext{w) = i^Lext{w), we deduce 

b{w) = #Bext(w) - #Rext(^w) - #Lext(^/;) + 1 > #Pext0(w) - 1. 

□ 

Proposition 35. Let u 6e an infinite word with language closed under G, M he G- 
distinguishing on u, and N > M be an integer. Then u has Property G-graph{N) if and 
only if any bispecial factor w of u of length at least N satisfies: 

• if w is not a G -palindrome, then 

h{w) = 0, 

• if w is a Q-palindrome for some Q & G, then 

h{w) = #Pext0(w) - 1; 

and 

AC{N) + 4G = Yl {Ve{N)+Ve{N + l)y 

Proof. (<^): The assumption on bilateral orders and the fact that non-bispecial 0-pa- 
lindromic factors have a unique 0-palindromic extension guarantee the following equality 
for all n > N: 

A'dn)^ J2 bH= Yl E (#PexteH-l). (10) 
«)e£n(u) eeG(2) «)e£„(u) 

w=Q{w) 

According to Corollary 33 it implies that u satisfies Property G-graph(A'^). 
(^): Let n> N. Using Proposition 34 and Corollary 33 we obtain 

A^C(n)= bH= E bH+ E E bH> 

w^0{w) w=Q{w) 

for all 

^ E E (#PexteH - 1) = A^CH 

w=Q{w) 

As the beginning and the end of our estimates is the same, the inequahties for b(w) 
deduced in Proposition 34 must be equalities, which was to prove. 

□ 
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8 Examples 

It is well known that any finite Coxeter group is isomorphic to a group generated by 
reflexion symmetries of a polygon in M''. The dihedral groups, usually denoted Dm, are 
groups of symmetries of the regular m-gon in the plane. For example, the regular pentagon 
depicted on Figure 1 has five reflexion symmetries of order 2 and four rotation symmetries 
of order 5 and identity of order 1, together has 10 elements. In general, the group 
Djn has 2m elements. As shown in [26], the generalized Thue- Morse word tb^m for 3-11 
parameters b >2 and m > 1, already considered in [24], is D„-rich. 




Figure 1: Regular pentagon and its symmetries. 

In this section we provide 2 examples of G-rich words, where G is not a dihedral group. 
In the first example, the group is isomorphic to the group of symmetries of an orthogonal 
prism with rhomb base, see Figure 2a. In the second example, the group is isomorphic 
to the group of symmetries of two pyramids joint together by their rectangular bases, see 
Figure 2b. 

The first group, denoted G, is constructed on an 8-letter alphabet ^ := {0, 1, . . . , 7}. 
The antimorphisms Gq, ©i and ©2 are defined on A* as follows 

©0 : ^ 2, 1 ^ 1, 2 ^ 0, 3 ^ 3, 4 ^ 6, 5 ^ 5, 6 ^ 4, 7 ^ 7, 
©1 : 4, 1 ^ 5, 2 ^ 6, 3 ^ 7, 4 ^ 0, 5 ^ 1, 6 ^ 2, 7 ^ 3, 
©2 : ^ 0, 1 ^ 3, 2 ^ 2, 3 ^ 1, 4 ^ 4, 5 ^ 7, 6 ^ 6, 7 ^ 5. 

The group G C AM{A*) is the group generated by ©0, ©1 and ©2. 

The second group, denoted H, is on a 6-letter alphabet B :— {0, 1, ... , 5} and H C 
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Figure 2: (a) Symmetries of jC{u) from Example 36 represented by the symmetries of an 
orthogonal prism with rhomb base, (b) Symmetries of >C(v) from Example 37 represented 
by two pyramids joint together by their rectangular bases. 

AM{B*) is generated by the 3 following antimorphisms: 

*o : Oh^O, 1h^1,2h^4,3h^5,4h^2,5h^3, 
*i : 0h^1,1h^0,2h^2,3h^3,4h^4,5h^5, 
*2 : 0^0,1^1,2^3,3^2,4^5,5^4. 

Example 36. Let (f : A* ^ A* he a. morphism defined as 

: ^-> 01, 1 ^ 2, 2 ^ 65, 3 H> 4, 4 ^ 23, 5 ^ 6, 6 ^ 47, 7 ^ 0. 

Denote by u the fixed point of (f. In Section 8.1, we show that u has its language closed 
under G and u is G-rich. 

Example 37. Let fi : A* ^-^ B* he a morphism defined as 

// : ^ 15, 1 ^ 04, 2 ^ 12, 3 ^ 03, 4 ^ 04, 5 ^ 12, 6 ^ 03, 7 ^ 15. 

Let V = A*(u). In Section 8.2, we show that C{v) is closed under H and v is if- rich. 

The proofs of properties of u and v are split into several lemmas. Instead of their 
complete proofs, we provide just sketches or hints for readers. 

8.1 G-richness of u 

To prove that u defined in Example 36 is G-rich means to show that u has Property 
G'-graph(l). For this reason, we exploit Proposition 35. Therefore, one needs to study 
bispccial factors occurring in u. We use a general method for circular morphisms described 
in [21]. In the case of (f, it yields simple results. To describe the bispecial factors of u, 
we introduce the mapping tt : {0, 2, 4, 6} i-> {0, 2, 4, 6} as follows 

TT : 2,4 0,2 4,6 6. 
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Lemma 38. Let w = Wo...Wn^i be a non-empty bispecial factor of u. Then w„_i G 
{0,2,4,6} and Lp{w)7i{wn^i) is a bispecial factor of u. Moreover, h{w) = h {ip{iv)'n{wn-i) ) ■ 

Proof. The claim follows from the definition of 99 and the fact that right special factors 
of length 1 are factors 0, 2, 4 and 6, and each has 2 right extensions. □ 

The next statement can be easily deduced from the form of ip as well. 

Lemma 39. Letw, \w\ >2, he a bispecial factor ofu. Then there exists a unique bispecial 
factor of u, say v — vqVi . . . Vm-i, such that 

W = (p{v)'K{Vm-l)- (11) 

According to the last two lemmas, all bispecial factors can be constructed from bis- 
pecial factors of length 1 using recursively formula (11). In fact, all the letters of A are 
bispecial factors and G-palindromes. We show that the formula (11) produces from a 
G-palindrome again a G-palindrome. 

Lemma 40. For all i e Z3 and w e jC{u), w = Wq . . . Wn-i, we have 

where yi{w) := 0i((/3(wo))g and Xi{w) = ((^6i(w„_i))g and (f)^ denotes the first letter of 
a word v. 

Sketch of the proof. The proof is done by induction on n. Supposing the claim holds for 
n, one needs to deal with different cases according to the value i and possible factors 
Wn-iWn e £2(u) = {54,62,47,12,04,76,65,40,01,23,30,26}. The claim then follows 
from the definitions of </?, ©j and ©i_i. □ 

Lemma 41. Let w = Wo...Wn-i be a non-empty bispecial factor ofu. Then w is a 
Q-palindrome, Q & G, and h{w) — #Pexte(if) — 1. 

Proof. As the bilateral order of a bispecial factor of length 1 equals 0, according to Lem- 
mas 38 and 39, all bispecial factors have their bilateral order equal to 0. It is also clear 
that they have 2 right and 2 left extensions. 

li w — Wo ■ ■ -Wn-i is a non-empty bispecial factor and if w is a Oj_i-palindrome for 
i G Z3, one can show that 7[{wn-i) = yi{w) and thus, Lp{w)TT{wn-i) is a ©^-palindrome. 
Since the bispecial factors of length 1 are ©2-palindromes, all bispecial factors are ©j- 
palindromes for some i G Z3. 

It follows that for all i G Z3, £(u) contains infinitely many ©j-pahndromes. Therefore, 
>C(u) is closed under G. 

Let w be a non-empty bispecial factor. Since h{w) = 0, w has 2 left and 2 right 
extensions, w is a ©-palindrome for a unique © G G, and C{u) is closed under G, one can 
see that the number of ©-palindromic extensions of w is 1. □ 
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Proof of G -richness of u defined in Example 36. At first, we realize that the generators 
©0, ©1 and ©2 of the group G guarantee the number 1 to be G-distinguishing on any 
infinite word over A. Because of Lemma 41 and Proposition 35, it remains to verify that 
AC(1) + 4j^G equals the number of all G-palindromes of length 1 and 2. One can readily 
see that AC(1) = 4, jj^G = 8, the number of G-palindromes of length 1 is 8 and the 
number of G-palindromes of length 2 is 4. 

□ 

8.2 if-richness of v 

The proof of i/-richness of v is very similar to the previous proof. In order to use 
Proposition 35, we explore the bilateral orders and if-palindromic extensions of bispecial 
factors of v. 

We define the morphism rj : A* ^ B* as 

77 : 041, 1 ^ 120, 2 ^ 031, 3 ^ 150, 4 ^ 150, 5 ^ 041, 6 ^ 120, 7 ^ 031. 

Let w — wq. . .Wn-i e ^* be a non-empty factor of u. It follows from >C2(u) and the 
definition of fi that ^{w)r]{wn^i) is a factor of v. The following lemma summarizes the 
relation between the bispecial factors of v and of u. 

Lemma 42. Let w G C{u), w = wq . . .Wn-i, be a non-empty bispecial factor. Then 

IJ,{w)ri{wn-i) is a bispecial factor of \ . 

On the other hand, if v E \v\ >5, is a bispecial factor of\, then there exists a 

unique non-empty bispecial factor w e >C(u) such that v — ii{w)r]{wn-i). 

Lemma 43. Let i e Z3. If w & ^(u) is a non-empty Qi-palindrome, then the factor 
li{w)r]{wn-i) e >C(v) is a i-palindrome. 

Sketch of the proof. We induce on the length of w. Fix i e Z3. Suppose the claim holds 
ior w = Wq . . . Wn-i = Qiiw). Take z E A such that zwQi{z) e C{\x). The proof follows 
from the definition of //, 77, and possible factors zwq G C2{vl). □ 

Proof of H -richness of w defined in Example 37. According to the previous lemma, it is 
clear that C{\) is closed under H . The properties of C{vl) also imply that all bispecial fac- 
tors of V of length greater than or equal to 5 have bilateral order and one ©-palindromic 
extension, where © G if is the unique antimorphism fixing the bispecial factor. For 
shorter bispecial factors, of length greater than 1, this property needs to be verified by 
hand and is left to the reader. 

Since 2 is an ff-distinguishing number on v, Proposition 35 requires to evaluate 
AC(2), Vq{2) and Pe(3) for all involutive antimorphism Q E H. It is easy to verify 
that AC (2) = 4, Eee//(2) ^e(2) = and Eee/fC^) ^e(3) = 12. Since H^H = 8, according 
to Proposition 35, v satisfies Property i?-graph(2). 

To claim that v is ii-rich, we need to verify that v satisfies ii-graph(l). Thus, it 
remains to show that all loops in ri(v) are ii-palindromes and the graph obtained from 
ri(v) by removing loops is a tree. Since it can be easily verified by hand, the word v is 
ii-rich. □ 
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Denote for all i G Z3 by Hj the subgroup of H generated by and ^'(i+i mods)- 
It is easy to verify that = 4 for all i, the number 1 is iJo-distinguishing and Hi- 
distinguishing, and the number 2 is if2-distinguishing. It follows from the last proof that 
p^,(3) = 4 and Pm,,(2) = for all i E Z3. One can also verify that V^^i^) = ^^2(1) = 2 
and Vq,,{l) = 4. Since AC(1) = 2, the word v is i^o-rich, i^i-rich. Since AC(2) = 4, it is 
almost if2-rich (it is not H2-nch.). 

9 Comments and open problems 

• The dihedral groups form a special class of finite Coxeter groups. As shown in 
[26], for any dihedral group there exists a D^-rich word. Is it possible for any given 
finite Coxeter group G to find a G-rich word? 

• The list of examples of G-rich words is very short, the list of almost G-rich words 
(which are not G-rich) is empty. In [16], Glen et al. described a class of morphisms 
such that morphic image of a rich word under this morphism has a finite non-zero 
defect. Find a class of morphisms producing almost G-rich words with finite non- 
zero defect by applying this morphism to a G-rich word. 

• For a word u with language closed under reversal, Brlek and Reutenauer conjectured 
in [8] for the defect D{u) that 

2D(u) = ^r(n), where T{n) -.^ AC{n) + 2 - V{n + 1) - V{n). 

This equality was shown to be valid for periodic words in [8] and for uniformly 
recurrent words in [5]. 

Can the G-defect of an infinite word u be expressed using the differences between 
right-hand and left-hand sides of inequalities in (3)? 

• In Section 2.4, definitions of rich words and ©-rich words were reminded. In our 

new terminology, they are {Id, i?}-rich words and {Id, 0}-rich words respectively. 
The groups {Id, i?} and {Id, 0} are clearly isomorphic. In [9], it is shown that a 
so-called ©-standard word with seed, which is almost ©-rich, is a morphic image 
of a standard Arnoux-Rauzy words, which is rich. In [22], we have a more general 
case: any uniformly recurrent almost ©-rich word is a morphic image of a rich word. 
Is an almost Gi-rich word related to a G2-rich word for some group G2 isomorphic 
to Gi? 

• Let u be an infinite word having language closed under a group G. The closedness 
under G can be exploited to estimate the number of distinct frequencies of factors 
of the same length n. In [2], an upper bound on this number is given (for n being 
a G-distinguishing number). The estimate is based on the inequality from Theo- 
rem 3. Looking at the proof of the estimate, it can be seen that the only candidates 
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for reaching the upper bound for all sufficiently large n are almost G-rich words. 
However, as noted in [2], in our words, almost G-richness does not imply the upper 
bound to hold for all sufficiently large n. 

• The Thue-Morse word is G-rich, where G is generated by two commuting antimor- 
phisms R and ©. However, it is not Gi-rich while taking a proper subgroup Gi of 
G. In our considerations, we did not assume the group G to be the maximal group 
of symmetries such that an infinite word u is closed under G. 

Suppose u is an infinite word having language closed under a group G. Let Gi be 
a proper subgroup of G containing at least one antimorphism. Suppose u is both 
almost G-rich and almost Gi-rich. Let be a G-distinguishing number. Then, 
according to Proposition 32, we can for all n> N write 

AC(n) + #G = (Vein)+Ve{n + 1)^ and 

GeG(2) 

AC(n) + #Gi = Yl {Vein)+Ve{n + 1)). 

Thus, we get 

4j.G-4i^G^^ J2 {Pe{n)+Ve{n + 1)). (12) 
0eG(2)\Gf^ 

Since Gi is a proper subgroup of G, we have #G = ^#Gi for i > 1. Take © e 
G(2) \ Gf^ and w e >C(u), 1^1 > N, such that e{w) = w. Then for all v e [w], it 
can be show that there exists ^ G G*-^-* \ G^^'* such that f is a ^-palindrome. Since 
^[w] = the right-hand side of (12) equals k{n)^ for k{n) G N. We get from 
(12) that 

(£-l)#Gi = Mn)£^. 

The only solution is £ = 2 and k{n) — 1 for all n. Thus, we obtain the following 
condition 

#G'i = ^#G= (Ve{n)+Ve{n + l)) for all n > iV. 

GeG(2)\Gf^ 

Indeed, these conditions are satisfied for the three subgroups Hq, Hi and H2 of the 
group H and the word v, see the last part of Section 8.2. 

Further characterization of such group and examples of such infinite words is an 
open problem. 
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